Unfolding protein with an atomic force microscope: 
Force-fluctuation induced non-exponential kinetics 
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We show that in experimental atomic force microscopy studies of the lifetime distribution of 
mechanically stressed folded proteins the effects of externally applied fluctuations can not be dis- 
tinguished from those of internally present fluctuations. In certain circumstances this leads to 
artificially non-exponential lifetime distributions which can be misinterpreted as a signature of pro- 
tein complexity. This work highlights the importance of fully characterizing and controlling external 
sources of fluctuation in mechanical studies of proteins before drawing conclusions on the physics 
at play on the molecular level. 



I. INTRODUCTION 



Atomic force microscopy (AFM) has emerged as a pow- 
erful tool with which to study biologically relevant sys- 
tems [U [2] . Probing the response of a protein to an ap- 
plied mechanical force allows for the direct investigation 
of the physical properties of the protein [3J 0] . Recent 
experiments have used lifetime distributions of a single 
folded protein under constant applied mechanical force 
to probe physics at the molecular scale [5J [B]. To do 
so, a folded protein is put under tension and the time 
to unfolding is measured. Many such individual experi- 
ments are collected (often over the course of days or even 
weeks) and their lifetimes combined to estimate the life- 
time distribution. Such lifetime distributions have been 
studied experimentally for both Ubiquitin and the 27th 
domain of immunoglobulin (127) 7, 8 . In some instances 
non-exponential behavior has been reported in measure- 
ments of Ubiquitin [TO] , and associated with glassy dy- 
namics [9 . Static configurational disorder of the folded 
protein structure combined with the Bell model |llj has 
recently been proposed as a plausible explanation for the 
observed stretched-exponential unfolding time distribu- 
tions [HE]. 

Non-exponential lifetime distributions in physical sys- 
tems are often a signature of underlying complex pro- 
cesses with multiple timescales operating at the micro- 
scopic level p~5l417| . Characterizing the range of rates 
present can provide insights on the fundamental physical 
mechanisms driving the decay process. However, non- 
exponential distributions can also arise in systems that 
are characterized by a single timescale, if external fluc- 
tuations are able to propagate through non-linearities of 
the system. Thus, if non-exponential behavior is to be 
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FIG. 1: (Color online) Schematic representation of a typical 
AFM setup for protein measurements. The deflection of the 
cantilever is measured as a differential voltage on the quad- 
rant photodetector. In order to achieve a force measurement 
one must convert the voltage measured on the photodector 
into a displacement of the tip and then measure the spring 
constant of the cantilever (typically done using the thermal 
tune method |14|h 



taken as proof of complex internal workings the effects of 
external fluctuations must be understood. 



II. RANDOM BELL MODEL 

In his model for cellular adhesion [IT], G.I. Bell needed 
a relationship between rate and mechanical force. He 
used a phenomenological relation that was found by 
Zhurkov and Kuksenko in a study of crack propagation 
in stressed bulk polymers [18]. In analogy with the Ar- 
rhenius relation at a given temperature T, this relation 
postulates an exponential relationship between the un- 



2 



folding rate a and the applied external force F given by 
a = a e^ Fx \ (1) 

where ao is the unfolding rate at zero applied force, 
[3 = 1/ksT and is a model parameter with units of 
length, usually interpreted as the distance to the tran- 
sition state [T§1 - |2"T] . This relationship was subsequently 
referred to as the "Bell model" and applied to the dy- 
namic strength of molecular bonds [19] and molecular 
measurements of protein unfolding by AFMs [22 . This 
emperical relation is an approximation of the mean es- 
cape time from a well (Kramers' problem). The general 
solution to such a problem depends on the details of the 
energy landscape and need not take a simple exponential 
form. Deviations from this approximation will manifest 
as a force dependent prefactor tto(f) and a force depen- 
dent x*(/) even in the absence of a complex or rough 
energy landscape [2"5] . 

In the application of this model to protein measure- 
ments configurational disorder enters as a source of fluc- 
tuation for x* and is sufficient to explain experimental 
results OH!]. In principle, however, one could consider 
the effects on the lifetime distribution of fluctuations in 
all of the parameters, a ,f3,F and x*. 

Fluctuations in ao or x* would provide a window into 
the physics on the microscopic scale [ElllH], whereas fluc- 
tuations in P and F simply arise from the experimental 
conditions and provide no insight into protein physics. 

We introduce the terminology Random Bell Model 
to indicate a model similar to the Bell Model (Eq{T]) 
in which each parameter is assumed to be randomly 
distributed. We will probe this model to determine the 
impact of various sources of fluctuations on the resulting 
distributions of rates and lifetimes. 
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FIG. 2: (Color online) Probability density of rate P{a), given 
by Eq. presented as a Log-Log plot. The curves corre- 
spond to values of j3Fx^ = 6.11 and «o = 10 — 3 s -1 , as given 
in reference [13 , for a fixed value of cr^ = 0.2, but varying 
values of ((T^,crj) = (0,0.2) in black solid line, (0.05,0.19) in 
orange dotted line, (0.1, 0.17) in red dashed line, (0.15,0.13) 
in purple dash dot line. Note that (tr^, crj) = (0, 0.2) leads to 
a log-normal distribution of unfolding rates, represented by a 
parabola in Log-Log scales. 



of fluctuations in force are present. Readily visible are 
the fluctuations in the force value observed over a single 
experimental trace. These fluctuations include 

1. Thermally induced fluctuations |24j in the position 
and curvature of the cantilever tip as well as the 
position of the sample. 

2. Fluctuations induced at high frequencies by the 
force feedback system (setup dependent). 



III. SOURCES OF FLUCTUATIONS 

Most experiments are done at room temperature, with- 
out strict thermal regulation, leading to fluctuations in 
temperature. However, even in a poorly controlled envi- 
ronment, the upper bound for thermal fluctuations will 
be on the order of 5°, leading to fluctuations in /3 of less 
than 2%. For the sake of brevity we will not consider 
these fluctuations as being significant. By definition, ctQ 
is the rate at temperature T and zero applied force, and 
should be inherent to the folded protein structure. We 
will therefore assume that a docs not fluctuate within a 
given protein, nor from one molecule to the next. Note 
that if it proved relevant, it would be straightforward 
to extend the following results to include fluctuations in 
both /3 and a Q . 

This leaves us with sources of fluctuations in F and x* 
to consider. As previously mentioned, fluctuations in x^ 
are the physical quantity of interest which may be masked 
by fluctations in F. In current experiments many sources 



3. Mechanical vibrations transmitted through the 
AFM to the cantilever tip or the sample (setup de- 
pendent) [25] . 

The combined magnitude of these fluctuations has been 
estimated to be ~ 10% for the apparatus in [10]. Less 
apparent, but likely more significant, are the fluctuations 
from experimental run to run induced by the calibration 
of the system (see Figure [I]) . These fluctuations include 

1. Errors in the calibration of the spring constant of 
the cantilever (estimates vary from 10% [55] to 20% 
[27] for the generally used thermal tune method of 
calibration). 

2. Time dependent drift in the spring constant of the 
AFM cantilever (on the order of 14% in 60 minutes 

E3). 

Therefore, a worst case estimate of the overall magnitude 
of the force fluctations would be between 20% and 25%. 
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IV. UNFOLDING RATE AND TIME 
DISTRIBUTIONS 

A. Unfolding rate distribution 

Let F and x* be random variables with respective 
means F and x* and standard deviations up and a x %. 
We introduce dimensionless random variables <f> and £ 
such that 



F = 0F, 
x* = £x*. 



(2) 
(3) 



These two random variables are described by the joint 
probability density J(4>, £). By definition the probability 
density of the product uj = 0£ is given by 



il,(w) 
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d<f>dZJ(<t>,Z)8(u>-(f>Z). 



(4) 



To simplify the previous relation, let's perform the fol- 
lowing change of variables, 



u 2 = 



whose Jacobian is 
3 = 



1 



(5) 
(6) 



(7) 



Then Eq|4] becomes 

r+oo 

Pa,(uj) = / du 1 du 2 



1 

1*1 1 



J( m,— J 5(u-u 2 ). (8) 
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The distribution of their product uj = 0£ is then given 
by the Rohatgi formula 28 



+ 00 



1*1 



(9) 



This expression of P w does not require ^ and £ to be 
independent, nor of any particular distribution. To pro- 
ceed further, however, we will make the experimentally 
reasonable assumption that the variables £ and <f> are 
two independent Gaussian variables, with mean 1 by 
construction and standard deviations — a x t /x* and 
0<t> = &f/F. Note that we do not make any assumption 
on the relative importance of these two kinds of fluctu- 
ations. The distribution of u) is obtained by numerical 
integration of Eq(9j 

The distribution of rates a is obtained through the 
change of variables 



= ccq exp ((3F: 



4, 



(10) 



leading to the following probability density of unfolding 
rates : 
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FIG. 3: (Color online) Dependence of unfolding time statis- 
tics on the relative weight of external fluctuations (a"^) and 
internal fluctuations (crj) for a fixed value of total fluctua- 
tions (<x w = 0.20), and for fiFx % = 6.11, a = 10~ 3 s~\ All 
four plots are for varying values of (<r$, crj) = (0, 0.2) in black 
solid line, (0.05,0.19) in orange dotted line, (0.1,0.17) in red 
dashed line, (0.15,0.13) in purple dash dot line, a) Cumula- 
tive probability function of unfolding times. The grey line is 
the best exponential fit. b) Difference between each curve in 
(a) and the curve for (a^, o^) = (0, cr w ). c) Probability density 
function of unfolding times, d) Difference between each curve 
in (c) and the curve for (a^,a^) = (0, a J). The distributions 
shown in (a) and (c) are experimentally indistinguishable. 
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An example of such a rate distribution is presented in 
figure [2] We observe a systematic change in the tails of 
P{a) as the relative weight of and is varied. As 
these changes are in the tails they may prove difficult to 
access experimentally. 



B. Unfolding time probability distributions 

The directly measurable cumulative unfolding time 
distribution is given by a Laplace transform of the rate 
distribution as 

POO 

C{t) = 1 - / daP a (a)e- at . (12) 
Jo 

The corresponding probability density is then given by 

p w = f (*)■ m 

Figures [3p, and [3p show plots of the cumulative proba- 
bility distributions and the probability density functions 
for a fixed value of er u , the standard deviation of the 
product variable ui, and different values of and <7£. 
The overall shape of the distribution appears to depend 
only on cr w : The resulting distributions differ by less than 
0.1% and thus are indistinguishable for all values of <T<p 
and <7£ for a fixed value of cr u . 

If (70 very nearly equals zero (i.e. no fluctuations in 
the externally applied force) no confusion is possible: 
any deviation from a strictly single exponential lifetime 
distribution must be attributed to internal fluctuations 
as described by [13]. However this cannot be the 
case experimentally. Given these sources of fluctuation 
it is unlikely that 0$ is negligible. Consequently, a good 
estimate of must be obtained before one can hope to 
learn anything about or its origins. 



C. Skewness of the rate distribution 

We have shown that unfolding time statistics are 
largely insensitive to the origin of the fluctuations. How- 
ever, we note that the underlying rate distribution be- 
comes asymmetric as the ratio between and grows, 
as shown in Figure [2j One way to separate the sources of 
disorder would be experimentally measure the assymctry 
of the rate distribution as characterized by the skewness. 
At present such a measurement is experimentally inacces- 
sible because unfolding rates are not experimentally mea- 
surable quantities. Instead, they must be estimated from 
the observed unfolding times using statistical techniques 
with all of their incumbent limitations, drawbacks, and 
errors [51 [TO]- We include the following study of the skew- 
ness for two reasons. First, it is the dominant evidence 
of the separability of internal and external fluctuations. 
Second, we do so as a way to motivate the development 
of experimental tools to allow such a measurement to be 
effected. 



T 




0.0 0.5 1.0 1.5 2.0 2.5 3.0 u % 



FIG. 4: (Color online) Evolution of the rate distribution skew- 
ness 7 as a function of a^/cr^, for 6 different values of , col- 
ored from top to bottom as = 0.20 (black), 0.19 (purple), 
0.18 (brown) 0.17 (red), 0.15 (orange), and 0.10 (yellow) for 
/3Fx* = 6.11, «o = lCT 3 s _1 . 




FIG. 5: (Color online) Evolution of the rate distribution skew- 
ness 7(0^, o^) as a function of erg, for different values of o^, 
increasing by step of 0.025 from 0.05 (bottom black curve) to 
0.325 (top purple curve), with fiFx % = 6.11, a = 10~ 3 s _1 . 

The skewness 7 of the distribution P(a) is defined as 

... _ (( Q ' (i»S 

7 ((a -a) 2 ) 3 / 2 ' 1 ' 

where a is the mean unfolding rate and (• • • ) stands 
for the average over P a . Figure [4] presents the evo- 
lution of the skewness 7 by varying the ratio of and 
<7£ for several fixed values of cr w , the standard deviation 
of the product uj = <f>£. We observe that the overall 
scale of the skewness grows with increasing <t w while the 
maximum, for fixed a u , always occurs when = erg. 
Note that skewness is of necessity symmetric under in- 
terchange of 0$ and <7j because of the symmetry of the 
original problem. Therefore, 7 must have a turning point 
when C0 = <T£, 

Ideally, one should first accurately estimate cr^, then 
a measurement of 7 will allow for to be read off the 
graph presented in Figure [5j Perhaps more realistically, 
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one can start with a fairly poor estimate of er^ and then 
measure 7 in a series of experiments in which additional, 
controlled, amounts of force fluctuation are injected into 
the experiment to allow for a differential measurement of 

V. CONCLUSIONS 

In this article we analyzed the effects of both internal 
and external fluctuations on unfolding time statistics in 
the context of the random Bell model. We highlighted the 
extremely weak effect these different origins have on the 
measurable unfolding time distributions, as illustrated in 
Fig|3] Thus it is difficult to attribute non-exponential ki- 
netics solely to the physics of a protein. This observation 
ultimately comes from the fact that lifetime distribution 
measurements rely on the assumption that the unfolding 
times are independent and identically distributed (HD). 
However, in the presence of external fluctuations this as- 
sumption may no longer be justified, particularly if data 
is gathered over the course of multiple days or weeks us- 
ing multiple AFM cantilevers with multiple calibrations. 
Even if the underlying behavior of a protein were per- 
fectly exponential such a lumping together of the data 
will yield non-exponential lifetime distributions. Statis- 
tical analyses, such as the maximum likelihood estimator 
[29j . might provide a way to relax the assumption of IID, 
at the cost of some arbitrariness in the choice of the like- 
lihood function [30] . 

In this article we have focused on the application to 
constant force protein measurements, however, these ar- 
guments can be equally well applied to other afm exper- 
iments. As an example, fluctuations in the calibration 



of experimental setups can have significant consequences 
on landscape reconstructions using force extension mea- 
surements [51] . 

Taken as a whole, these results indicate that it is ex- 
tremely difficult to experimentally distinguish between 
internal and external sources of fluctuations by measur- 
ing lifetime distributions. External fluctuations can mask 
the more interesting internal fluctuations. As a necessary 
but not sufficient prerequisite to measuring the magni- 
tude of internal fluctuations one must first minimize and 
quantify the level of external fluctuations. Therefore, the 
internal physics will remain obscured unless AFM force 
experiments are accompanied by an estimation of the 
types and magnitudes of external fluctuations and the 
way that such fluctuations propogate through the mea- 
sured quantities. 

In light of these findings there is a need for new exper- 
imental methods in order to gain insight into the physics 
at play on the molecular scale without relying on a statis- 
tical analysis. One promising route is to avoid statistical 
analysis of collected unfolding time by focusing on pro- 
tein dynamics. In the spirit of nano-rheology experiments 
[521 155] , direct measurements of the dynamical global re- 
sponse of individual molecules appears as a promising 
route to understand how interatomic interactions con- 
tribute to the mechanical properties of a protein. 
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